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A METHOD FOR IN VITRO MOLECULAR EVOLUTION 
OF PROTEIN FUNCTION 

Field of the invention 

The present invention relates to a method for in vitro molecular 
evolution of protein function, in particular by shuffling of single stranded DNA 
segments obtained using an exonuclease. 

Background of the Invention 

Protein function can be modified and improved in vitro by a variety of 
methods, including site directed mutagenesis (Alber et a/, Nature, 5; 
330(61 43);41 -46, 1987) combinatorial cloning (Huse et a/, Science, 
246:1275-1281, 1989; Marks et a/, Biotechnology, 10: 779-783, 1992) and 
random mutagenesis combined with appropriate selection systems (Barbas et 
a/, PNAS. USA, 89: 4457-4461, 1992). 

The method of random mutagenesis together with selection has been 
used In a number of cases to improve protein function and two different 
strategies exist. Firstly, randomisation of the entire gene sequence in 
cpmbination with the selection of a variant (mutant) protein with the desired 
characteristics, followed by a new round of random mutagenesis and 
selection. This method can then be repeated until a protein variant is found 
which is considered optimal (Schier R. et a/, J. Mol. Biol. 1996 263 (4): 551- 
567). Here, the traditional route to introduce mutations is by error prone PCR 
(Leung et a/, Technique, 1: 11-15, 1989) with a mutation rate of 
approximately 0.7%. Secondly, defined regions of the gene can be 
mutagenized with degenerate primers, which allows for mutation rates up to 
100% (Griffiths ef a/, EMBO. J, 13: 3245-3260, 1994; Yang et a/, J. Mol. Biol. 
254: 392-403, 1995). The higher the mutation rate used, the more limited the 
region of the gene that can be subjected to mutations. 

Random mutation has been used extensively in the field of antibody 
engineering. In vivo formed antibody genes can be cloned in vitro (Larrick ef 
a/, Biochem. Biophys. Res. Commun. 160: 1250-1256, 1989) and random 
combinations of the genes encoding the variable heavy and light genes can 
be subjected to selection (Marks et a/, Biotechnology, 10: 779-783, 1992). 
Functional antibody fragments selected can be further improved using 
random mutagenesis and additional rounds of selections (Schier R. et a/, J. 
Mol. Biol. 1996 263 (4): 551-567). 

The strategy of random mutagenesis is followed by selection. Variants 
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with interesting characteristics can be selected and the mutated DNA regions 
from different variants, each with Interesting characteristics, are combined 
into one coding sequence (Yang ef a/ ( J. Mol. Biol. 254: 392-403, 1995), This 
is a multi-step sequential process, and potential synergistic effects of different 
mutations in different regions can be lost, since they are not subjected to 
selection in combination. Thus, these two strategies do not include 
simultaneous mutagenesis of defined regions and selection of a combination 
of these regions. Another process involves combinatorial pairing of genes 
which can be used to improve eg antibody affinity (Marks et a/, Biotechnology, 
10: 779-783, 1992). Here, the three CDR-regions in each variable gene are 
fixed and this technology does not allow for shuffling of individual gene 
segments in the gene for the variable domain, for example, including the CDR 

regions, between clones. 

The concept of DNA shuffling (Stemmer, Nature 370: 389-391, 1994) 
utilizes random fragmentation of DNA and assembly of fragments into a 
functional coding sequence. In this process it is possible to introduce 
chemically synthesized DNA sequences and in this way target variation to 
defined places in the gene which DNA sequence is known (Crameri et a/, 
Biotechniques, 18: 194-196, 1995). Stemmer and coworkers developed this 
in vitro method, which reassemble the normally occurring evolution process of 
protein in nature. The DNA shuffling generates diversity by recombination, 
combining useful mutations from individual genes. It has been used 
successfully for artificial evolution of different proteins, e.g. enzymes and 
cytokines (Chang et al. Nature Biotech. 17, 793-797, 1999; Zhang et al. 
Proc. Natl. Acad. Sci. USA 94, 4504-4509,1997; Christians et a/. Nature 
Biotech. 17, 259-264, 1999). The genes are randomly fragmented using 
DNAse I and then reassembled by recombination with each other. The 
starting material can be either a single gene (first randomly mutated using 
errror-prone PCR) or naturally occurring homologous sequences so called 
family shuffling. The optimal enzyme for fragmentation of genetic material is 
an enzyme, which can generate different sizes of DNA in a fast and easy 
way. DNAse I hydrolyses DNA preferentially at sites adjacent to pyrimidine 
nucleotides, therefore it Is a suitable choice for random fragmentation of DNA. 
However, the activity is dependent on Mg or Mn ions, Mg ions restrict the 
fragment size to 50bp, while the Mn ions will give fragment sizes less than 
50bp. Therefore, in order to have all possible sizes for recombination the 
gene in question needs to be treated at least twice with DNAse I in the 
presence of either of the two different ions, followed by removal of these very 
same ions. 

Using the unique features of BAL31 nuclease in the DNA shuffling 
process i.e. the FIND technology (Fragment Induced Nucleotide Diversity), 
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renders the ability of a fast, easy and controllable system. This enzyme can 
give all sizes of gene fragments and the activity of the enzyme can be easily 
controlled by stopping the digestion at various time points. BAL 31 is 
predominately a 3' exonuclease that removes mononucleotides from both 3' 
termini of the two strands of a linear DNA. BAL 31 Is also an endonuclease; 
thus the single-stranded DNA generated by the 3' exonuclease activity is 
degraded by the endonuclease. The 3' exonuclease activity of the enzyme 
works about 20-fold more efficiently than the endonuclease. The enzyme 
concentrations are therefore important for the obtained DNA fragments. High 
concentration of enzyme favors blunt-ended DNA whereas at low 
concentrations the single-stranded DNA termini may be very long. BAL 31 
consists of two kinetically distinct forms of the enzyme, a fast (F) and a slow 
(S) form. The S form is a proteolytic degradation product of the F form. 
Furthermore, BAL 31 works asynchronously, generating a population of DNA 
molecules whose termini have been resected to various extents and whose 
single-stranded tails vary in length. The enzyme uses Ca 2+ as a co-factor 
which can be bound in complex with EGTA (Ethylene Glycol bis (p-amino 
ethyl Ether) N,N,N\N'-tetra acetic acid). Linear DNA sequences are digested 
with BAL31 and the reaction stopped at different time points by the addition of 
EGTA. 

In theory, it is possible to shuffle DNA between any clones. However, 
if the resulting shuffled gene is to be functional with respect to expression and 
activity, the clones to be shuffled have preferably to be related or even 
identical with the exception of a low level of random mutations. DNA shuffling 
between genetically different clones will generally produce non-functional 
genes. However, it has been proven by the methodology of ITCH Y that 
interspecies fusion libraries can be created between fragments of the E. coli 
and human glycinamide ribonucleotide transformylase genes, which have 
only 50% Identity on the DNA level (Ostermeier et a/, Nat Biotechnol 17, 
1205-9, 1999). 

A successful recombination of two different genes requires formation of 
hetero-duplex molecules. In some cases the family shuffling almost only form 
homo-duplexes resulting in a low frequency of recombination.' The problem 
can be circumvented by using single-stranded DNA as starting material 
(Kikuchi et al. 2000) or by using restriction enzymes for the digestion (Kikuchi 
et a/. 1999). The latter gives very efficient recombination but the variation of 
gene fragments generated is limited compared to the one obtained from 
DNAse I treatment. An alternative enzyme for the digestion is the BAL 31 
nuclease. 

Single-stranded DNA can be obtained in essentially two different ways. 
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Firstly, by the use of biotinylated primers in the PCR reactions (AffiniTip 
Streptavidin Capture Micro-columns, Manufacturers Manual). Secondly, by 
utilising bacteriophage that are able to pack single-stranded DNA. (Viruses 
and Related Entitles in Modern Microbiology, Principles and Applications 
pp.171-192, Ed. E.A. Birge, Wm. C, Brown Publishers1992; Maniatis et al. 
Molecular Cloning, A laboratory manual 2 nd edition. Cold Spring Habor 
Laboratory Press. 1989) 

Selection of enzymes with altered and Improved properties are often 
based on the actual function of the enzyme. Increased thermostability can be 
selected for by incubating transformant colonies at temperatures that cause 
inactivation of wild type enzyme and p-glucosidase activity can be measured 
( "\ by using PNPG as the subst^te (Arrizubieta et a/ J Biol Chem Jun 27, 2000) 

Selection of functional proteins from molecular libraries has-been /V 
*e\^<ritontee^ phage display technology (Parmley ^'C3 

efa/, Gene, 73: 305-391 1988; McCafferty et al, Nature, 348: 552-554, 1990; 
Barbas et al, PNAS. USA, 88: 7978-7982, 1991). Here, the phenotype 
(protein) is directly linked to its corresponding genotype (DNA) and this allows 
for directly cloning of the genetic material which can then be subjected to 
further modifications in order to improve protein function. Phage display has 
been used to clone functional binders from a variety of molecular libraries with 
up to 10 11 transformants in size (Griffiths et a/, EMBO. J. 13: 3245-3260, 
1994). Thus, phage display can be used to directly clone functional binders 
from molecular libraries, and can also be used to improve further the clones 
originally selected. Other types viruses that have been used for surface 
expression of protein libraries and selections thereof are baculovirus (Boubllk 
et al Biotecnol 13:1079-1084. 1995; Mottershead et al Biochem Biophys Res 
Com 238:717-722, 1997; Grabherr et al Biotechniques 22:730-735, 1997) 
and retrovirus (Buchholz et al Nature Biotechnol 16:951-954, 1998). 

Selection of functional proteins from molecular libraries can also be 
performed by cell surface display. Also here, the phenotype is directly linked 
to Its corresponding genotype. Bacterial cell surface display has been used 
for eg. screening of improved variants of carbozymethyl cellulase (CMCase) 
(Kim et al Appl Environ Microbiok 66:788-93, 2000). Other cells that can be 
used for this purpose are yeast cells (Boder and Wittrup Nat. Biotechnol 
15:553-557, 1997), COS cells (HIguchi et al J Immunol Meth 202:193-204, 
1997), and insect cells (Granzerio et al J Immunol Meth 203:131-139, 1997; 
Ernst et al Nucleic Acids Res 26:1718-1723, 1998). 

Random combination of DNA from different mutated clones In 
combination with selection of desired function is a more efficient way to 
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search through sequence space as compared to sequential selection and 
combination of selected clones. 

This invention aims at combining the unique features of the FIND 
technology with the concept of single-stranded DNA. This will render a 
controllable, fast and easy technology to obtain molecules with improved 
properties. The unique combination of techniques will most probably give 
unexpected advantages such as more efficient recombination and shuffling, 
which will give rise to more versions of altered molecules and thereby a 
greater probability to find the molecule with exactly the desired properties. 

Summary of the invention 

According to one aspect of the present invention, there Is provided a 
method for generating a polynucleotide sequence or population of sequences 
from parent single stranded polynucleotide sequences encoding one or more 
protein motifs, comprising the steps of 

a) providing single stranded DNA constituting plus and minus strands of 
parent polynucleotide sequences; 

b) digesting the single stranded polynucleotide sequences with an 
exonuclease to generate populations of single stranded fragments; 

c) contacting said fragments generated from the plus strands with 
fragments generated from the minus strands and optionally add 
primer sequences that anneal to the 3 'and 5 'ends of at least one 

of the parent polynucleotides under annealing conditions; 

d) amplifying the fragments that anneal to each other to generate at least 
one polynucleotide sequence encoding One or more protein motifs 
having altered characteristics as compared to the one or more 
protein motifs encoded by said parent polynucleotides. 

Therefore, typically, there is provided a method of combining 
polynucleotide fragments to generate a polynucleotide sequence or 
population of sequences of desired characteristics, which method comprises 
the steps of: 

(a) digesting a linear parent single-stranded polynucleotide 
encoding one or more protein motifs with an exonuclease to generate a 
population of single-stranded fragments of varying lengths; 

(b) assembling a polynucleotide sequence from the sequences 
derived from step (a). 

Preferably the method further comprises the step of (c) expressing the 
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resulting protein encoded by the assembled polynucleotide sequence and 
screening the protein for desired characteristics. 

By controlling the reaction time of the exonuciease the size of the 
polynucleotide fragments may be determined. Determining the lengths of the 
polynucleotide fragments in this way avoids the necessity of having to provide 
a further step such as purifying the fragments of desired length from a gel. 

In order to generate a polynucleotide sequence of desired 
characteristics the parent polynucleotide encoding one or more protein motifs 
may be subjected to mutagenesis to create a plurality of differently mutated 
derivatives thereof. Likewise, a parent polynucleotide may be obtained 
already encoding a plurality of variant protein motifs of unknown sequence. 

Random mutation can be accomplished by any conventional method 
as described above, but a suitable method is error-prone PGR. 

It is preferable to use PCR technology to assemble the single-stranded 
polynucleotide fragments into a double-stranded polynucleotide sequence. 

The polynucleotide sequence is preferably DNA although RNA may be 
used. For simplicity the term polynucleotide will now be used in the following 
text in relation to DNA but it will be appreciated that the present invention is 
applicable to both RNA and DNA. 

Any exonuciease that digests polynucleotide from the 3 1 prime end to 
the 5 ! prime end or from both the 3* and the 5 1 end may be used. Examples of 
a suitable exonuciease which may be used in accordance with the present 
invention include BAL31 , and Exonuciease VII. 

The individual digested fragments are purified, mixed and reassembled 
with PCR technology. The assembled (reconstituted) gene may then be 
cloned into an expression vector for expressing the protein. The protein may 
then be analyzed for improved characteristics. 

The method of the present invention provides several advantages over 
known shuffling techniques. 

Further, the method of the present invention produces a set of 
progressively shortened DNA fragments for each time point a DNA sample Is 
taken from the BAL31 treatment. The DNA samples may be collected and 
pooled together or, optionally, individual samples may be chosen and used in 
the method. Thus the present invention allows a selection of what DNA 
samples are to be used In the recombination system and thereby offers a 
further degree of control. 

The method of the present invention may be carried out on any 
polynucleotide which codes for a particular product for example any protein 
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having binding or catalytical properties e.g. antibodies or parts of antibodies, 
enzymes or receptors. Further, any polynucleotide that has a function that 
may be altered for example catalytical RNA may be shuffled in accordance 
with the present invention, It Is preferable that the parent polynucleotide 
encoding one or more protein motif is at least 12 nucleotides in length, more 
preferably at least 20 nucleotides in length, even more preferably more than 
50 nucleotides in length. Polynucleotides being at least 100 nucleotides [n 
length or even at least 200 nucleotides in. length may be used. Where parent 
polynucleotides are used that encoded for large proteins such as enzymes or 
antibodies, these may be many hundreds or thousands of bases in length. 
The present invention may be carried out on any size of parent 
polynucleotide. 

The present invention also provides polynucleotide sequences 
generated by the method described above having desired characteristics. 
These sequences may be used for generating gene therapy vectors and 
replication-defective gene therapy constructs or vaccination vectors for DNA- 
based vaccinations. Further, the polynucleotide sequences may be used as 
research tools. 

The present invention also provides a polynucleotide library of 
sequences generated by the method described above from which a 
polynucleotide may be selected which encodes a protein having the desired 
characteristics. It is preferable that the polynucleotide library is a DNA or 
cDNA library. 

The present inventions also provides proteins such as enzymes, 
antibodies, and receptors having characteristics different to that of the wild 
type produced by the method described above. These proteins may be used 
individually or within a pharmaceutical^ acceptable carrier as vaccines or 
medicaments for therapy, for example, as Immunogens, antigens or otherwise 
in obtaining specific antibodies. They may also be used as research tools. 

The desired characteristics of a polynucleotide generated by the 
present invention or a protein encoded by a polynucleotide generated by the 
present invention may be any variation In the normal activity of the wild type 
(parent) polynucleotide or the polypeptide, protein or protein motifs it 
encodes. For example, it may be desirable to reduce or increase the catalytic 
activity of an enzyme, or improve or reduce the binding specificity of an 
antibody. Further, if the protein, or polynucleotide is an immunogen, it may 
be desirable to reduce or increase its ability to obtain specific antibodies 
against it. The parent polynucleotide preferably encodes one or more protein 
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motifs. These are defined by regions of polynucleotide sequence, that encode 
polypeptide sequence having or potentially having characteristic protein 
function. For example, a protein motif may define a portion of a whole protein, 
I.e. an epitope or a cleavage site or a catalytic site etc. However, within the 
scope of the present invention, an expressed protein motif does not have to 
display activity, or be "correctly" folded. 

It may be desirable to modify a protein so as to alter the conformation 
of certain epitopes, thereby improving its antigenicity and/or reducing cross- 
reactivity. For example, should such a protein be used as an antigen, the 
modification may reduce any cross-reaction of raised antibodies with similar 

. proteins. 

Although the term "enzyme" is used, this is to be interpreted as also 
including any polypeptide having enzyme-like activity, i.e. a catalytic function. 
For example, polypeptides being part of an enzyme may still possess catalytic 
function. Furthermore, proteins such as interferons and cytokines should be 
included. Likewise, the term "antibody" should be construed as covering any 
binding substance having a binding domain with the required specificity. This 
includes antibody fragments, derivatives, functional equivalents and 
homologues of antibodies, including synthetic molecules and . molecules 
whose shape mimics that of an antibody enabling it to bind an antigen or 
epitope. Examples of antibody fragments, capable of binding an antigen or 
other binding partner are Fab fragment consisting of the VL, VH, CI and CH1 
domains, the Fd fragment consisting of the VH and CH1 domains; the Fv 
fragment consisting of the VL and VH domains of a single arm of an antibody; 
the dAb fragment which consists of a VH domain; isolated CDR regions and 

/ F(ab')2 fragments, a bivalent fragment including two Fab fragments linked by 

a disulphide bridge at the hinge region. Single chain Fv fragments are also 
included. 

In order to obtain expression of the generated polynucleotide 
sequence, the sequence may be incorporated in a vector having control 
sequences operably linked to the polynucleotide sequence to control its 
expression. The vectors may include other sequences such as promoters or 
enhancers to drive the expression of the inserted polynucleotide sequence, 
further polynucleotide sequences so that the protein encoded for by the 
polynucleotide is produced as a fusion and/or nucleic acid encoding secretion 
signals so that the protein produced In the host cell is secreted from the cell. 
The protein encoded for by the polynucleotide sequence can then be 
obtained by transforming the vectors into host cells in which the vector is 



Exhibit A8 



9 



functional, culturing the host cells so that the protein is produced and 
recovering the protein from the host cells or the surrounding medium. 
Prokaryotic and eukaryotic cells are used for this purpose in the art, including 
strains of E. coli, yeast, and eukaryotic cells such as COS or CHO cells, The 
choice of host cell can be used to control the properties of the protein 
expressed in those cells, e.g. controlling where the protein Is deposited in the 
host cells or affecting properties such as its glycosylate. 

The protein encoded by the polynucleotide sequence may be 
expressed by methods well known in the art. Conveniently, expression may 
be achieved by growing a host cell in culture, containing such a vector, under 
appropriate conditions which cause or allow expression of the protein. 

Systems for cloning and expression of a protein in a variety of different 
host cells are well known. Suitable host cells include bacteria, eukaryotic 
cells such as mammalian and yeast, and baculovirus systems. Also, utilising 
the retrovirus system for cloning and expression is a good alternative, since 
this virus can be used together with a number of cell types. Mammalian cell 
lines available in the art for expression of a heterologous polypeptide include 
Chinese hamster ovary ceils, HeLa cells, baby hamster kidney cells, COS 
cells and many others. A common, preferred bacterial host is E. coli. 

Suitable vectors can be chosen or constructed, containing appropriate 
regulatory sequences, including promoter sequences, terminator fragments, 
polyadenyiation sequences, enhancer sequences, marker genes and other 
sequences as appropriate. Vectors may be plasmids, viral e.g. 'phage, or 
phagemid, as appropriate. For further details see, for example, Molecular 
Cloning: a Laboratory Manual: 2nd edition, Sambrook ef a/., 1989, Cold 
Spring Harbor Laboratory Press. Many known techniques and protocols for 
manipulation of polynucleotide sequences, for example in preparation of 
polynucleotide constructs, mutagenesis, sequencing, Introduction of DNA into 
cells and gene expression, and analysis of proteins, are described in detail in 
Current Protocols in Molecular Biology, Ausubel et a/, eds., John Wiley & 
Sons, 1992. 

The system can be used for the creation of DNA libraries comprising 
variable sequences which can be screened for the desired protein function in 
a number of ways. Enzyme function can be screened for with methods 
specific for the actual enzyme function eg. CMCase activity, p-glucosidase 
activity and also thermostability. Furthermore, phage display and cell surface 
display may be used for screening for enzyme function (Crameri A. et a/, 
Nature 1998 15; 391 (6664):288-291; Zhang J. H. ef a/, PNAS. USA 1997 94 
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(9): 4504-4509; Warren M.S. et a/, Biochemistry 1996, 9; 35(27): 8855-8862; 
Kim ef a/ Appl Environ Mlcroblok 66:788-93, 2000) as well as for altered 
binding properties of eg. antibodies (Griffith et a/., EMBO J. 113: 3245-3260, 
1994). 

A protein provided by the present invention may be used in screening 
for molecules which affect or modulate Its activity or function. Such 
molecules may be useful in a therapeutic (possibly including prophylactic) 
context. 

The present invention also provides vectors comprising polynucleotide 
sequences generated by the method described above. 

The present inventions also provides compositions comprising either 
polynucleotide sequences, vectors comprising the polynucleotide sequences 
or proteins generated by the method described above and a pharmaceutical^ 
acceptable carrier or a carrier suitable for research purposes. 

The present invention also provides a method comprising, following the 
identification of the polynucleotide or polypeptide having desired 
characteristics by the method described above, the manufacture of that 
polypeptide or polynucleotide in whole or in part, optionally in conjunction with 
additional polypeptides or polynucleotides. 

Following the identification of a polynucleotide or polypeptide having 
desired characteristics, these can then be manufactured to provide greater 
numbers by well known techniques such as PCR, cloning and expression 
within a host cell. 

The resulting polypeptides or polynucleotides may be used in the 
preparation of industrial enzymes eg. laundry detergent enzymes where an 
increased activity is preferred at lower temperatures. Alternatively, the 
manufactured polynucleotide or polypeptide may be used as a research tool, 
I.e. antibodies may be used in Immunoassays, and polynucleotides may be 
used as hybridization probes or primers. Alternatively, the resulting 
polypeptides or polynucleotides may be used in the preparation of 
medicaments for diagnostic use, pharmaceutical use, therapy etc. as 
discussed as follows, 

The polypeptides or polynucleotides generated by the method of the 
invention and identified as having desirable characteristics can be formulated 
in pharmaceutical compositions. These compositions may comprise, In 
addition to one of the above substances, a pharmaceutical^ acceptable 
excipient, carrier, buffer, stabilizer or other materials well known to those 
skilled in the art. Such materials should be non-toxic and should not interfere 
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with the efficacy of the active ingredient, The precise nature of the carrier or 
other material may depend on the route of administration, e.g. oral, 
Intravenous, cutaneous or subcutaneous, nasal, intramuscular, Intraperitoneal 
routes. 

Pharmaceutical compositions for oral administration may be in tablet, 
capsule, powder or liquid form. A tablet may include a solid carrier such as 
gelatin or an adjuvant Liquid pharmaceutical compositions generally include 
a liquid carrier such as water, petroleum, animal or vegetable oils, mineral oil 
or synthetic oil. Physiological saline solution, dextrose or other saccharide 
solution or glycols such as ethylene glycol, propylene glycol or polyethylene 
glycol may be Included. 

For intravenous, cutaneous or subcutaneous injection, or injection at 
the site of affliction, the active ingredient will be in the form of a parenteral^ 
acceptable aqueous solution which is pyrogen-free and has suitable pH, 
isotonicity and stability. Those of relevant skill in the art are well able to 
prepare suitable solutions using, for example, isotonic vehicles such as 
Sodium Chloride Injection, Ringer's Injection, Lactated Ringer's Injection. 
Preservatives, stabilizers, buffers, antioxidants and/or other additives may be 
included, as required. 

Whether it is a polypeptide, e.g. an antibody or fragment thereof, an 
enzyme, a polynucleotide or nucleic acid molecule, identified following 
generation by the present invention that is to be given to an individual, 
administration is preferably in a "prophylactically effective amount" or a 
"therapeutically effective amount" (as the case may be, although prophylaxis 
may be considered therapy), this being sufficient to show benefit to the 
individual. The actual amount administered, and rate and time-course of 
administration, will depend on the nature and severity of what is being 
treated. Prescription of treatment, e.g. decisions on dosage etc, is within the 
responsibility of general practitioners and other medical doctors, and typically 
takes account of the disorder to be treated, the condition of the individual 
patient, the site of delivery, the method of administration and other factors 
known to practitioners. Examples of the techniques and protocols mentioned 
above can be found in Remingtons Pharmaceutical Sciences, 16th edition, 
Osol.A. (ed), 1980. 

Alternatively, targeting therapies may be used to deliver the active 
agent more specifically to certain types of cell, by the use of targeting 
systems such as antibody or cell specific ligands. Targeting may be desirable 
for a variety of reasons; for example If the agent is unacceptably toxic, or if it 
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would otherwise require too high a dosage, or if it would not otherwise be 
able to enter the target cells. 

Instead of administering these agents directly, they could be produced 
in the target cells by expression from an encoding gene introduced Into the 
cells, e.g. in a viral vector (a variant of the VDEPT technique i.e. the 
activating agent, e.g. an enzyme, is produced in a vector by expression from 
encoding DNA in a viral vector). The vector could be targeted to the specific 
cells to be treated, or it could contain regulatory elements which are switched 
on more or less selectively by the target cells. 

Alternatively, the agent could be administered in a precursor form, for 
conversion to the active form by an activating agent produced in, or targeted 
to, the cells to be treated, This type of approach is sometimes known as 
ADEPT or VDEPT; the former Involving targeting the activating agent to the 
cells by conjugation to a cell-specific antibody, while the latter involves 
producing the activating agent, e.g. an enzyme, in a vector by expression 
from encoding DNA in a viral vector (see for example, EP-A-415731 and WO 
90/07936). 

A composition may be administered alone or in combination with other 
treatments, either simultaneously or sequentially dependent upon the 
condition to be treated. 

As a further alternative, the polynucleotide identified as having 
desirable characteristics following generation by the method of the present 
invention could be used in a method of gene therapy, to treat a patient who is 
unable to synthesize the active polypeptide encoded by the polynucleotide or 
unable to synthesize It at the normal level, thereby providing the effect 
provided by the corresponding wild-type protein. 

Vectors such as viral vectors have been used in the prior art to 
introduce polynucleotides into a wide variety of different target cells. Typically 
the vectors are exposed to the target cells so that transfection can take place 
in a sufficient proportion of the cells to provide a useful therapeutic or 
prophylactic effect from the expression of the desired polypeptide. The 
transfected nucleic acid may be permanently incorporated into the genome of 
each of the targeted tumour cells, providing long lasting effect, or alternatively 
the treatment may have to be repeated periodically. 

A variety of vectors, both viral vectors and plasmid vectors, are known 
in the art, see US Patent No. 5,252,479 and WO 93/07282. In particular, a 
number of viruses have been used as gene transfer vectors, including 
papovavlruses, such as SV40, vaccinia virus, herpes viruses, Including HSV 
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and EBV ? and retroviruses. Many gene therapy protocols In the prior art have 
used disabled murine retroviruses. 

As an alternative to the use of viral vectors other known methods of 
introducing nucleic acid Into cells Includes electroporation, calcium phosphate 
co-precipitation, mechanical techniques such as microinjection, transfer 
mediated by liposomes and direct DNA uptake and receptor-mediated DNA 
transfer. 

As mentioned above, the aim of gene therapy using nucleic acid 
encoding a polypeptide, or an active portion thereof, is to increase the 
amount of the expression product of the nucleic acid in cells in which the level 
of the wild-type polypeptide is absent or present only at reduced levels. Such 
treatment may be therapeutic in the treatment of cells which are already 
cancerous or prophylactic in the treatment of individuals known through 
screening to have a susceptibility allele and hence a predisposition to, for 
example, cancer. 

The present invention also provides a kit for generating a 
polynucleotide sequence or population of sequences of desired 
characteristics comprising reagents for ssDNA preparation, an exonuclease 
and components for carrying out a PCR technique, for example, thermostable 
DNA (nucleotides) and a stopping device, for example, EGTA. 

As outlined above the present Invention conveniently provides for the 
creation of mutated enzyme gene sequences and their random combination 
to functional enzymes having desirable characteristics. As an example of this 
aspect of the invention, the enzyme genes are mutated by error prone PCR 
which results in a mutation rate of approximately 0.7%. The resulting pool of 
mutated enzyme genes are then digested with an exonuclease, preferably 
BAL31, and the reaction inhibited by the addition of EGTA at different time 
points, resulting in a set of DNA fragments of different sizes. These may then 
be subjected to PGR based reassembly as described above. The resulting 
reassembled DNA fragments are then cloned and a gene library constructed. 
Clones may then be selected from this library and sequenced. 

A further application of this technology is the generation of a 
population of variable DNA sequences which can be used for further 
selections and analyses. Besides encoding larger proteins, e.g. antibody 
fragments and enzymes, the DNA may encode peptides where the molecules 
functional characteristics can bo used for the design of different selection 
systems. Selection of recomblned DNA sequences encoding peptides has 
previously been described (Fisch et al PNAS. USA 1996 Jul 23; 93 (15): 
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7761-7766); In addition, the variable DNA population can be used to produce 
a population of RNA molecules with e.g. catalytic activities. Vaish ef a/ 
(PNAS. USA 1998 Mar 3; 95 (5): 2158-2162) demonstrated the design of 
functional systems for the selection of catalytic RNA and Eckstein F (Clba 
Found. Symp. 1997; 209; 207-212) has outlined the applications of catalytic 
RNA by the specific introduction of catalytic RNA in cells. The system may be 
used to further search through the sequence space in the selection of 
functional peptides/molecules with catalytic activities based on recombined 
DNA sequences. 

Aspects and embodiments of the present invention will now be 
illustrated, by way of example, with reference to the accompanying figures. 
Further aspects and embodiments will be apparent to those skilled in the art. 
All documents mentioned in this text are incorporated herein by reference. 

Brief description of the drawings 

Figure 1 shows the principle of the method from template molecule to 
improved molecule; 

Figure 2 shows the principle steps in preparation of single stranded 
DNA using biotin; 

Figure 3 shows the principle steps in the preparation of single 
stranded DNA using phage; 

Figure 4 shows the principle steps generating single stranded DNA 
fragments using exonuclease treatment; 

Figure 5 shows the principle steps for assembly of single stranded 
DNA fragments using PCR. 

Detailed description and exemplification of the invention 

The DNA shuffling procedure can be illustrated by the steps shown in 
Figures 1-5. The gene encoding the protein of interest (X) In the plasmid 
pFab5chis is used in this example. Random mutations are introduced by error 
prone PCR. Single-stranded DNA is prepared. This can be carried out by 
either biotlnylated primers or by the use of phage being able to pack single- 
stranded DNA, as discussed above. The coding and the non-coding ssDNA 
strands are prepared in different reactions (A and B). The ssDNA strands 
from either reactions are subjected to separate enzymatic treatment using 
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e.g, BAL 31. By mixing the two pools of single-stranded DNA fragments in 
equimolar amount the gene can be reassembled in a shuffled nature and in 
many versions by the use of two subsequent PCR reaction, where the first 
reaction contains no primers. After cloning this library of reassembled genes 
in pY, selections can be performed to achieve the improved molecule of 
interest. 

A more detailed description of examples of the present invention is 
given below. 

Reagents: 

AmpliTaq® polymerase was purchased from Perkin-Elmer Corp., 
dNTPs from Boehringer Mannheim Biochemlca (Mannheim, Germany), and 
BAL31 Nuclease from New England Biolabs Inc. (Beverly, USA). All 
restriction enzymes were purchased from New England Biolabs Inc. (Beverly, 
USA). Ethidium bromide was purchased from Bio-Rad Laboratories (Bio-Rad 
Laboratories, Hercules, CA, USA). T4 DNA Ligase was purchased from New 
England Biolabs Inc. (Beverly, USA). 

All primers were designed in the laboratory and obtained from Life 
Technologies (Taby, Sweden) 

PCR: 

All Polymerase Chain Reactions (PCR) were carried out in a automatic 
thermocycler (Perkin-Elmer Cetus 480, Norwalk, CT,USA). PCR techniques 
for the amplification of nucleic acid are described in US Patent No. 4,683,195. 
The PCR reactions were run at varying amounts of cycles consisting of 
following profile: denaturation (94°C, 1 minute), primer annealing (55°C, 1 
minute) and extension (72°C, 1 minute) + 72 °C for 7 minutes. Each PCR 
reaction should contain 10 mM Tris-HCI, pH 8.3, 50 mM KCI, 1.5 mM MgCfe, 
200 fiM dNTP, 1 fiM forward primer, 1 /*M reverse primer, 1.25 U AmpliTaq® 
thermostable DNA polymerase (Perkin-Elmer Corp.), and 50 fmol template in 
a final volume of 100 pi, unless otherwise is noted. In all PCR experiments 
these parameters were used and the number of reaction cycles was varied. 
References for the general use of PCR techniques include Mullis et a/, Cold 
Spring Harbor Symp. Quant. Biol., 51:263, (1987), Ehrlich (ed), PCR 
technology, Stockton Press, NY, 1989, Ehrlich et a/, Science, 252:1643-1650, 
(1991), "PCR protocols; A Guide to Methods and Applications", Eds. Innis et 
a/, Academic Press, New York, (1990). 
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Sequencing: 

All constructs have been sequenced by the use of BlgDye Terminator 
Cycle Sequencing kit (Perkln-Elmer, Elmervlll, CA, USA). The sequencing 
was performed on a ABI Prism 377 DNA Sequencer. 

Agarose electrophoresis: 

Agarose electrophoresis of DNA was performed with 2% agarose gels 
(AGAROSE (FMC Byproducts, Rockland, ME, USA)) with 0.25^/ml ethldlum 
bromide in Tris-acetate buffer (TAE-buffer 0.04M Tris-acetate, 0.001 M 
EDTA). Samples for electrophoresis were mixed with a sterile filtrated loading 
buffer composed of 25% Ficoll and Brdmphenolic blue and loaded into wells 
in a the 2% agarose gel. The electrophoresis was run at 90 V for 45 minutes 
unless otherwise stated In Tris-acetate buffer with 0.25 /*g/ml ethidium 
bromide. Bands of appropriate size were gel-purified using the Qiaquick Gel 
Extraction Kit (QIagen GmbH, Hilden, Germany). As molecular weight 
standard, DNA molecular weight marker 1 kb ladder (Gibco BRL) was used. 
The DNA-concentratlon of the gel extracted products were estimated using a 
spectrophotometer. 

Bacterial Strains: 

The Escherichia constrain TOP10F was used as a bacterial host for 
transformations. Chemically competent cells of this strain were produced 
basically as described Hanahan, D. 1983. Studies on transformation of 
Escherichia coli with plasmids. J. Mol. Biol. 166: 557-580. Electrocompetent 
cells of this bacterial strain were produced (Dower, W.J., J. F. Miller, and 
C.W. Ragsdale. 1988: High efficiency transformation of E.coii by high voltage 
electroporation. Nucleic Acids Res. 16:6127). 

Plasmids: 

All genetic manupulations were performed in pFab5chis according to 
Molecular cloning; A LABORATORY MANUAL (Second Edition, Cold Spring 
Harbor Laboratory Press, 1989). 

Primers: 

Two blotlnylated primers surrounding the antibody gene of pFab5chls 
were designed with the following sequences including designated unique 
restriction sites: 
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1736 Sfil forward primer; 

5'-ATT ACT CGC GGC CCA GCC GGC CAT GGC CCA CAG GTC AAG CTC 
GA 

and 1735 Notl reversed primer: 

5'-TTA GAG CCT GCG GCC GCC TTG TCA TCG TCG TCC TT 

Two non-biotinylated primers surrounding the antibody gene of 
pFabSchis were designed with the following sequences including designated 
unique restriction sites: 

1664 Sfil forward primer: 

S'-ATT ACT CGC GGC CCA GCC GGC CAT GGC CCA CAG GTC AAG CTC 
GA 

and 1635 Notl reversed primer: 

5'-TTA GAG CCT GCG GCC GCC TTG TCA TCG TCG TCC TT 



Error Prone PGR: 

The error prone PCR reactions were carried out in a 10 x buffer 
containing 500 mM NaCI, 100 mM Tris-HCI, pH 8.8, 5mM MgCfe 100 fig 
gelatine (according to Kulpers et al Nucleic Acids Res. 1991, Aug 25; 19 
(16):4558) except for a raise in the MgCI 2 concentration from 2 mM to 5 mM). 

For each 100 ^l reaction the following was mixed: 

dATP5mM5/il 

dGTP5mM5/il 

dTTP10mM10/il 

dCTP10mM10^l 

20 M 3' primer 1 3fi\ 

20 M 5'-primer 1.5/xl 

10x Kuipers buffer 10 
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sterile mp H 2 0 46.3 /tl 

The template In pF ( ab5chis vector was added at an amount of 50 ng. 
10 /tl of 10 mM MnCI 2 was added and the tube was checked that no 
precipitation of Mn0 2 occurred. At last 5 Units of Taq enzyme was added. The 
error prone PCR was run at the following temperatures for 25 cycles without a 
hot start: 94°C 1\ 45 °C 1\ 72 °C 1' , + 72 °C for 7 minutes. The resulting 
product was an error proned insert over the protein of approximately 750 bp. 
This insert was purified with Gibco PCR purification kit, before further 
treatment. 



Generation of single stranded DNA by biotinylated prirrfrs 
The fragment of interest was amplified by two separate PCR reactions. These 
reactions can be normal PCR as desribed above or error prone PCR also as 
described above. The primers should be designed so that in one reaction the 
forward primer is biotinylated and In the other reaction the reverse primer is 
biotinylated. For example, PCR reactions with A) primers 1736 and 1635 and 
B) primers 1664 and 1735, with the above mentioned profile was performed 
for 25 cycles with pFab5chis-antibody as template. This yielded PCR- 
products of approximately 750 bp where in A the upper strand was 
biotinylated and in B the lower strand was biotinylated. 
The non-biotinylated strands are retrieved by purification using a solid matrix 
coated with streptavidin e.g AffiniTip Streptavldin Capture Micro-columns. The 
tips are equilibrated with Binding buffer containing 5 mM Tris pH 7.5, 1 M 
NaCI, 0.5 mM EGTA and 0.05% Tween 20. 50 /tl of each PCR product is 
mixed with 50 /tl 2 x Binding buffer and captured to separate equilibrated tip 
by the use of a micropipettor and incubated at room temperature for at least 
10 minutes. Unbound PCR products are removed by careful washing three 
times with 1000 fi\ Binding buffer and once, with 1000 /tl sterile water. The 
non-biotinylated strand of the captured DNA is eluted.by alkaline denaturation 
by letting the DNA incubate with 40 /tl 0.2 M NaOH, 2 mM EDTA for 10-15 
minutes. The solution is expelled from the tip by the use of a micropipettor 
and neutralized with 8 /tl 1 M HCI or 4 /tl 3 M sodium acetate pH 7.0-7.5. 

Generation of single stranded DNA using phage: 

The fragment of interest was cloned Into bacteriophage M13 vectors 

M13mp18 and M13mp19 using Pstl/Hindlll restriction enzymes. The 
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bacteriophage were propagated using Escherichia coii-strain TOP10F' 
according to conventional methods. Single stranded DNA for the upper strand 
was prepared from bacteriophage vector M13mp18 and single stranded DNA 
for the lower strand was prepared from bacteriophage vector Ml3mp19. 
Briefly, 1.5 ml of an infected bacterial culture was centrifuges at 12 OOOg for 5 
minutes at 4°C The superntant was precipitated with 200 /*! 20% 
PEG8000/2.5 M NaCI. The pelleted bacteriophage was resuspended in 100 
fi\ TE. 50 p\ phenol equilibrated with Tris-CI (pH 8.0) was added and the 
sample was vortexed. After centrifugation at 12 OOOg for 1 minute at RT the 
upper phase, containing the DNA, was transferred and precipitated with 
ethanoi. The DNA pellet was dissolved in 50 TE (pH 8,0) and stored at - 
20°C. (Maniatis et a/. Molecular Cloning, A laboratory manual 2 nd edition. 
) Cold Spring Habor Laboratory Press. 1989, chapter 4). Single stranded DNA 

prepared from phage is circular and must be opened prior to BAL31 
treatment. This can be performed with an endonuclease able to cleave single 
stranded DNA. 

Generation of single stranded fragmented DNA using BAL 31: 

The ssDNA strands from either reactions (containing upper and lower 
strands, respectively) were subjected to separate enzymatic treatment using 
e.g. BAL 31. Each digestion reaction should contain ssDNA, 600 mM NaCI, 
20 mM Tris-HCI, 12 mM CaCI2, 12 mM MgCI2, 1 mM EDTA pH 8.0 and BAL 
31 at various enzyme concentrations ranging from 0.1-5 U/m! : The reactions 
are incubated at 30°C and fractions of digested ssDNA are collected 
sequentially e.g. at 10, 30, 60 and 120 seconds or longer. The reactions are 
stopped by addition of EDTA and heat treatment at 65°C for 10 minutes. The 
ssDNA fragments are purified by phenol/cloroform extraction and ethanoi 
) percipitated. The ssDNA are resuspended in 10 mM Tris pH 8.0. 

The digestion pattern can be evaluated by 1 % agarose gel electrophoresis, 
see below. 

Purification of digestion produced fragments: 

Digested DNA fragments were purified by 
phenol/chloroform/isoamylalcohol extraction. 50 fi\ of buffered phenol was 
added to each tube of 100 p\ sample together with 50 p\ of a mixture of 
chloroform and Isoamylalcohol (24:1). The tubes were vortexed for 30 
seconds and then centrifuged for 1 minute In a mlcrofuge at 14000 r.p.m. The 
upper phase was then collected and mixed with 2.5 volumes of 99.5% 
Ethanoi (1/10 was 3M Sodium Acetate, pH 5.2). The DNA was precipitated for 
1 hour In -80 °C. The DNA was then pelleted by centrifugation for 30 minutes 
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[n a microfuge at 14.000 r.p.m. The pellet was washed once with 70% ethanol 
and then re-dissolved in 10 /xl of sterile water. 

Analysis of digestion produced purified fragments on agarose gel: 

5 fi\ of the dissolved pellet from each time point and from the blank 
were mixed with 2.5 of loading buffer (25% Ficoll and Bromphenolic blue) 
and loaded into wells in a 2% agarose gel. The electrophoresis of the 
different tlmepoints were performed as above. 

Reassembly of full length fragments: 

Reassembly of the ssDNA fragments is achieved by two sequential PCR 
reactions. The first PCR reaction should contain 10 mM Tris-HCI, pH 8.3, 50 
mM KCI, 1.5 mM MgCI 2 , 200 pM dNTP, 0.3 U Taq polymerase and 2 fi\ 
BAL31 treated sample, all in a final volume of 25 pi, and subjected to 5 cycles 
with the following profile: 94 °C for 1 minute, 50 °C for 1 minute and 72 °C for 
2 minutes + 72 °C for 5 minutes. The second PCR reaction should contain 
10 mM Tris-HCI, pH 8.3, 50 mM KCI, 1.5 mM MgCl a 200 /*M dNTP, 0.6 U Taq 
polymerase, 1 fiM forward primer, 1 ^M reverse primer, and 5 /tl sample from 
the first PCR reaction, all in a final volume of 50 fi t and subjected to 15 cycles 
with the following profile: 94 °C for 1 minute, 55 °C for 1 minute and 72 °C for 
2 minutes + 72 °C for 7 minutes. 

The resulting products can be evaluated by agarose gel electrophoresis. 

Restriction digestion of reassembled fragment and plasmid with Sfil and 
Notl: 

The reassembled fragment and the plasmid pFab5chis were first 
cleaved with Sfil by using NEB buffer 2 including BSA and 11 U enzyme//tg 
DNA. The reaction was carried out for 4 h at 50°C. After this the DNA was 
cleaved with Notl by adding conversion buffer and 61) enzyme/^g DNA. This 
reaction was carried out for 37 °C over night. 

Gel purification of restriction digested vector and restriction digested 
reassembled fragment: 

The cleavage reactions were analysed on a 1% agarose gel. The 
restriction digested insert showed a cleavage product of about 750 bp. This 
corresponds well with the expected size. The band of the cleaved insert and 
plasmid was cut out and gel-extracted as previously described. 
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